Data and text mining Peak Selection from MALDI-TOF Mass Spectra Using Ant Colony Optimization
نویسندگان
چکیده
Motivation: Due to the large number of peaks in mass spectra of low-molecular-weight (LMW) enriched sera, a systematic method is needed to select a parsimonious set of peaks to facilitate biomarker identification. We present computational methods for matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) spectral data preprocessing and peak selection. In particular, we propose a novel method that combines ant colony optimization (ACO) with support vector machines (SVM) to select a small set of useful peaks. Results: The proposed hybrid ACO-SVM algorithm selected a panel of 8 peaks out of 228 candidate peaks from MADLI-TOF spectra of LMW enriched sera. An SVM classifier built with these peaks achieved 94% sensitivity and 100% specificity in distinguishing hepatocellular carcinoma from cirrhosis in a blind validation set of 69 samples. Area under the ROC curve was 0.996. The classification capability of these peaks is compared with those selected by the SVM-recursive feature elimination method.
منابع مشابه
Peak selection from MALDI-TOF mass spectra using ant colony optimization
MOTIVATION Due to the large number of peaks in mass spectra of low-molecular-weight (LMW) enriched sera, a systematic method is needed to select a parsimonious set of peaks to facilitate biomarker identification. We present computational methods for matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) spectral data preprocessing and peak selection. In particular, we propose a ...
متن کاملA Designed Experiments Approach to Optimization of Automated Data Acquisition during Characterization of Bacteria with MALDI-TOF Mass Spectrometry
MALDI-TOF MS has been shown capable of rapidly and accurately characterizing bacteria. Highly reproducible spectra are required to ensure reliable characterization. Prior work has shown that spectra acquired manually can have higher reproducibility than those acquired automatically. For this reason, the objective of this study was to optimize automated data acquisition to yield spectra with rep...
متن کاملDiagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets
With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...
متن کاملPeptide Peak Detection for Low Resolution MALDI-TOF Mass Spectrometry.
A new peak detection method has been developed for rapid selection of peptide and its fragment ion peaks for protein identification using tandem mass spectrometry. The algorithm applies classification of peak intensities present in the defined mass range to determine the noise level. A threshold is then given to select ion peaks according to the determined noise level in each mass range. This a...
متن کاملOptimization of matrix assisted desorption/ionization time of flight mass spectrometry (MALDI-TOF-MS) for the characterization of Bacillus and Brevibacillus species
Over the past few decades there has been an increased interest in using various analytical techniques for detecting and identifying microorganisms. More recently there has been an explosion in the application of matrix assisted laser desorption ionization time of flight mass spectrometry (MALDI-TOF-MS) for bacterial characterization, and here we optimize this approach in order to generate repro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007